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Directly imaging damped Lyman-a galaxies at z > 2. 
I: Methodology and First Results*. 
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ABSTRACT 

We present the methodology for, and the first results from, a new imaging programme aimed 
at identifying and characterizing the host galaxies of damped Lyman-a absorbers (DLAs) at 
z > 2. We target quasar sightlines with multiple optically-thick Hi absorbers and use the 
higher-redshift system as a "blocking filter" (via its Lyman-limit absorption) to eliminate all 
far-ultraviolet (FUV) emission from the quasar. This allows us to directly image the rest-frame 
FUV continuum emission of the lower-redshift DLA, without any quasar contamination and 
with no bias towards large impact parameters. We introduce a formalism based on galaxy 
number counts and Bayesian statistics with which we quantify the probability that a candidate 
is the DLA host galaxy. This method will allow the identification of a bona fide sample of 
DLAs that are too faint to be spectroscopically confirmed. The same formalism can be adopted 
to the study of other quasar absorption line systems (e.g. Mg n absorbers). We have applied this 
imaging technique to two QSO sightlines. For the z ~ 2.69 DLA towards J073 149+285449, 
a galaxy with impact parameter b = 1.54" = 11.89 h-^ 1 kpc and implied star formation rate 
(SFR) of ~ 5 hy 2 2 M Q yr _1 is identified as the most reliable candidate. In the case of the 
z ~ 2.92 DLA towards J21 1444-005533, no likely host is found down to a 3cr SFR limit of 
1.4 M Q yr _1 . Studying the Hi column density as a function of the impact parameter, 
including 6 DLAs with known hosts from the literature, we find evidence that the observed 
H i distribution is more extended than what is generally predicted from numerical simulation. 

Key words: Method: statistics - Galaxies: high-redshift - Galaxies: quasars: absorption lines 
- Galaxies: quasars: individual: J21 1444-005533 and J073 149+285449. 



1 INTRODUCTION 

Absorption lines detected along the line of sight to quasi-stellar ob- 
jects (QSOs) and gamma ray bursts (GRBs) can be used to glean 
the properties of the intergalactic medium (IGM) and the inter- 
stellar medium (ISM) at high redshift. Before the advent of large 
millimeter and radio arrays such as the Atacama Large Millime- 
ter Array (ALMA) or the Square Kilometre Array (SKA), the only 
available way to characterize the physical properties of the differ- 
ent gas phases in the high redshift Universe is through the analysis 
of hydrogen, metal, and molecular absorption lines. Although the 
sizes of the regions explored through the background QSO beam 
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are too narrow to provide a detailed picture of individual objects, 
large spectroscopic surveys of QSOs across the sky enable the study 
of the absorbers as a population. This can lead to profound insights 
on the gas properties in high-redshift galaxies, crucial to constrain 
models of galaxy formation and evolution. 

One of the most well-studied classes of absorbers are the 
damped Lya absorbers (DLAs). With an Hi column density Nh 
2 x 10 20 cm -2 , t he DLAs contain mo st of the neutral gas in the 
Universe at z ~ 3 (O'Me ara et al .120071) . Also, by being associated 
with high gas overdensities in the cosmic web, DLAs are intimately 
connected with galaxy formation at high redshifts. 

Besides the actual identification of DLAs, absorption spec- 
troscopy can provide detailed information about the Hi column 
density distribution of the absorbers, their chemical composition 
and kinematics, as well as the physic al state of the neutral hydrogen 
(see the review bv lWolfe et aiB oQ5). As a result of several decades 
of observations of DLAs, the distribution of neutral hydrogen in 
the Universe an d its evolution with r e dshift is well constrained 
at high redshift jProchaska etai]|2005l ; IProchaska & Wolfell2009l; 
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iNoterdaeme et al.ll2009l) . Still, pencil beam surveys yield only a 
limited picture of the morphology of DLA galaxies. In turn, this 
limits the utility of the absorbers for studying galaxy assembly and 
evolution. 

While at low re dshift (z < 1), DL As are clearly associ- 
ated with galaxies (e.g. lZwaan et alj2005al and references therein), 
the nature of high red shift DLAs is still uncertain. Since their 
discovery dWolfe et al.lll986t) . the absorbers have often been as- 
sociated with massive disks , as suggested by veloci ty profiles 
of low-ion metal transitions (Prochaska & Wolfel ll997l) and con- 
sistent with recent findings that massive thick disks with typi- 
cal rotational velociti es up to 200 km/s are already in place at 
redsh ift z ~ 2 - 3 l lGenzel et aH 120061 : iFdrster Schreiber et al.l 
2009). However, the abundance patterns in DLAs indicate star for- 
mat ion histories more similar to those of dwarf irreg ular galax- 
ies {Dessauges-Zavadsky, Calura, Prochask a et al . 2007), while the 
elusive nature of the DLA galaxies hints towards a population of 
low surface-brightness systems. 

From a theoretical point of view, smoothed particle hydrody- 
namic (SPH) simulations which include gas physics are able to re- 
produce most of the observed DLA proper ties within a cold dark 
matte r (CDM) formulation (see, however, Ijedamzik & Prochaskal 
Il998h . Although the results may depend on the treatment of feed- 
back and winds, there is general agreement that the major contri- 
bution to the DLA cross-section at z ~ 3 comes from low- and 
intermediate-mass halos with 10 9 < M vil /M Q < 10 12 (see 
also iBarnes & Haehneltll2009l) . This is consisten t with the value 
M vir = 10 11 2 M inferred bv lCooke et all feOOrj) from the clus- 
tering of DLAs and Lyman-break galaxies (LBGs). Nevertheless, 
the debate around DLAs has not yet been settled. In fact, sim- 
ulations tend to predict small impact parameters, suggesting that 
DLAs are more compact at high redshifts than modern disk galax- 
ies. But this caus es simulations to unde r-predict the observed rate 
of incidence (e.g. Nagamine et al. 2007) or the number of high ve- 
locity absorbers jPontzen et al. 2008). Several mechanisms such as 
tidal -streams, outflows (e.g . lSchavel2001r) or filamentary structures 
(e.g.|Razoumov et al. 2006) and cold flows pen etrating inside mas- 
sive halos feres et alj2005l ; lDekel et alj2009l) can provide a larger 
cross-section for DLA gas. More quantitative analysis of adaptive 
mash refinement (AMR) simulations are ongoing to understand if 
gas overdensities inside these more extended structures can repro- 
duce the spectrum of kinematics observed in DLAs, as well as the 
incidence of the absorbers. 

To identify which one, or which combination, of the above 
scenarios applies to DLAs requires direct imaging of the galaxies 
responsible for the absorption. Unfortunately, this task is partic- 
ularly difficult at optical wavelengths due to the bright emission 
of the background quasar. In the past years, several attempts have 
been made in this directiorQ, typically by inspecting the resid- 
ual images after subtracting out the quasar light. However, the 
galaxy counterparts of these absorbers are expected to be faint 
and probably at low i mpact parameters (e.g. IWolfe&Cher] r2006; 
Nagamine et al. 2007). Therefore, imperfecti ons of the quasar sub- 
tracti on are a challenge to such studies (see lKulkarni et ai1l2000l 
1200 lh . As a result, only six spectroscopically- confirmed galaxy 
counterparts are currently known at z > 1.9 jM0ller & Warred 
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Figure 1. The SDSS spectrum of QSO J073 149+285449 (black histogram). 
Also labeled are the Lya and Lyman-limit wavelengths of two absorbing 
systems along the sightline: (i) a Lyman-limit system (LLS) at z;; s = 3.55 
and (ii) a DLA at z,n a = 2.69. The yellow shaded region corresponds to 
the wavelength range where the rest-frame FUV emission from the lower- 
z DLA can be detected without quasar contamination. The n-band (blue 
dashed line) and V-band (green dash-dotted line) LRIS filter transmission 
curves are overplotted to show that the quasar light is fully absorbed by the 
LLS in the u-band image. 



See Appendix [5] for a review of previous studies aimed at identifying 
DLA galaxies. 



To overcome these limitations, new techniques are being ex- 
plored. Surveys based on adaptive optics and improved modelling 
of the QSO point spread function (PSF) can minimise the impact of 
the quasar light on nearby objects, although some regions at very 
small impact parameters may still not be accessible. Narrow-band 
images from integral field unit (IFU) observations have the great 
advantage of providing both spatial and redshift information at the 
same time. Unfortunately, current instruments do not provide very 
high sensitivity at the short wavelengths needed to detect the Lya 
line at z ~ 2 (see lChristensen et alj|2007b . A very promising tech- 
nique to image high-z absorbers was proposed by lO'Meara et al.l 
(2006), who considered imaging Mg II absorbers at z ~ 2. The ba- 
sic idea, presented in more detail in section [2] consists of imaging 
QSO sightlines with two known high column density absorbers. 
The higher-redshift absorber can then act as a natural filter to block 
the quasar light, so that the rest-frame far-ultraviolet (FUV) emis- 
sion of the lower-redshift DLA can be detected without any con- 
tamination from the quasar. 

This paper, the first of a series, presents initial results from a 
new survey to image DLAs at 2 = 2 — 3, using the above technique. 
In section [2] we discuss the target selection criteria, in section [3] 
we describe the observations of two quasar fields, the data reduc- 
tion procedure, and our results, while, in sections|4]and|5] we focus 
on different methods to identify the galaxy counterparts. Analysis 
and discussion follow in sections [6] and [7] while section [8] summa- 
rizes our present results and considers prospects for the future. We 
adopt a A-cold-dark-matter (ACDM) cosmology throughout this 
paper, with Q, m = 0.3, Oa = 0.7, and H = 72 km s _1 Mpc -1 . 
All lengths are proper distances unless otherwise stated. Physical 
quantities are computed including the Hubble constant, in units of 
h T2 = 0.72. 



Imaging DLAs at z > 2 3 



2 SURVEY DESIGN 

The selecti on criteria for our tar gets are based on an updated ver- 
sion of theloi Mearaetal method, used to eliminate quasar 
contamination. We search among all the known QSOs with a fore- 
ground DLA in the Sloan Digital Sky Survey (SDSS) that also 
harbour a higher-redshift Lyman-limit system (LLSfl By requir- 
ing Nhi> 10 18 cm -2 for the LLS, we only include absorbers that 
are highly optically thick (r > 10) to Lyman continuum photons. 
This configuration of two absorbers allows us to use the higher- 
redshift absorber to completely block the quasar light, allowing 
the FUV emission of the lower-redshift DLA to be imaged with- 
out any quasar contamination or source confusion from the QSO 
host galaxy. 

An example is provided in FigureQ] where we show the SDSS 
spectrum of the QSO J073 149+285449. For illustrative purposes, 
the LRIS3 u and V filter transmission curves are superimposed 
with blue dashed and green dash-dotted lines, respectively. The 
quasar spectrum exhibits an LLS at zu s = 3.55 with a correspond- 
ing Lya absorption line at ~ 5500A and a Lyman limit (LL) at 

= 912 Ax(l + z Us ) ~ 4150A. In addition, Lya absorption 
from a lower redshift DLA can be seen at ~ 4500A, with an asso- 
ciated Lyman limit at X^la = 912 Ax(l + z dla ) ~ 3400A. The 
higher-redshift LLS entirely absorbs the quasar light at A < Xjfa , 
as seen in the spectrum. This allows the lower-redshift DLA to be 
imaged in filters covering wavelengths blueward of A^ s . 

A comparison between the u-band and i?-band images of 
two such fields, containing the quasars J21 1444-005533 (top) and 
J073 149+285449 (bottom), is shown in Figured The QSOs are 
visible only in the _R-band images (the left panels of the figure), 
while they are fully absorbed by the higher-redshift LLSs in the u- 
band images (the right panels). This allows the detection of faint 
lower-redshift galaxies at all impact parameters. It is useful to note 
that, even if the LLS is associated with a star-forming galaxy, it is 
unlikely to be visible in the u filter unless the galaxy has a very 
high escape fraction. Moreover, the UV light from the LLS will 
eventually recover from the absorption at A <C Ajjs , but without a 
significant contribution in the u-band. This prevents confusion for 
the identification of the lower- z DLA. The LLS could be detected 
in the V- or i?-band imaging as a Lyman-break galaxy, unless it is 
projected onto the quasar. 

Next, since the DLA has its own Lyman limit, its FUV emis- 
sion can be detected only in the wavelength interval 

A^ L a = 912 Ax (1 + z dla ) < A < A{£ = 912 Ax(l + z lls ) , (1) 

highlighted with a yellow shaded area in Figure [TJ Therefore, it 
is strategic to impose the selection criterion that Zdia <C zu B , so 
as to maximise the emission from the DLA in the u-band filter. 
Conversely, a lower limit on the DLA redshift is imposed by the 
condition 1215 Ax (1 + z dla ) > 912 Ax (1 + z ns ), i.e. the DLA 
Lya line has to be visible in the s pectrum. The latter im proves upon 
the original selection criterion of lO'Meara et al.l 12006), who relied 
on strong metal lines (e.g. Mg n, Fe n, etc ) to infer the presence o f 
a high column-density absorber (see also lChristensen et alj [2009). 
While strong Mg II absorb ers and DLAs ar e often considered highly 
overlapping populations dRao et alj[l995l) . only ~ 35% of strong 
Mg II absorbers have been found to have a column density above 

2 We refer to the second absorber as an LLS to make clear the distinction 
with the target DLA, at a lower redshift, that is to be imaged. However, the 

higher-redshift absorber can also be a DLA. 

d Low Resolution Imaging Spectrometer at Keck I lOke etalj 1995b . 



the DLA limit dRao et ail l2006). An additional advantage of our 
approach is that the Hi column density for our targets is directly 
measurable from the Lya absorption line, implying that it is pos- 
sible to determine the metallicity and relative abundances in the 
absorber's ISM along the QSO sightline. The downside is that we 
impose another constraint on the redshift separation of the two ab- 
sorbers which limits the number of possible targets. Finally, two 
additional conditions set the absolute redshift space that we can 
probe with our survey using current technology. An upper limit 
at Zdia ~ 3.5 is imposed by the absorption from the intergalac- 
tic medium (IGM). In fact, at higher redshifts the blanketing ef- 
fect of the IGM starts affecting the emission from the DLA galaxy, 
lowering the chance of a detection. Conversely, a lower limit at 
Zdia ~ 2.1 is imposed by the target selection using SDSS and, 
more generally, by the use of optical rather than UV facilities. We 
note, finally, that the short wavelength imaging can be carried out 
with either ground- or space-based facilities. 

At first, the requirement that two absorbers should lie in a nar- 
row range of redshifts along a single sightline may suggest that 
we will be able to target only a few systems in this particular spa- 
tial configuration. However, among the ~ 1000 D LAs known at 
2 > 2.1 from the SDSS (DR5; |Prochaska et alj2005h . ~ 140 sight- 
lines meet our selection criteria. Therefore, the proposed technique 
is a promising way of obtaining a large sample of DLAs for a sta- 
tistical study of the emission properties of the host galaxies. Note 
that it is important to restrict the wavelength range that is imaged to 
the region between the Lyman limits of the two DLAs, to minimize 
both the leakage from the QSO and the sky emission at A < A^. 
Using a tunable medium-band filter would be ideal for this project, 
but such filters are not typically available on large telescopes. To 
isolate a first sample of high-priority targets, we require that the 
broad-band filters that are currently available overlap with the FUV 
visibility window defined by Eq. <[TJ . With these additional con- 
straints, we have selected a sample of ~ 40 sightlines, 20 of which 
will be imaged with Wide Field Camera-3 (WFC3) on the Hubble 
Space Telescope (HST0, and ~ 20 with ground-based facilities. 

In summary, with this survey we aim to increase the num- 
ber of known host galaxies of high-z DLAs, over a wide range 
of both redshift (z = 2 — 3.5) and Hi column density (Nhi= 
2 x 10 20 — 7 x 10 21 cm -2 ). Since any bias towards large impact 
parameters is completely removed, even non-detections of DLA 
emission will provide interesting constraints on the star formation 
rates (SFRs) of the absorbers. While such non-detections might 
have been attributed to the quasar glare in previous studies, our 
technique will yield robust upper limits on the DLA luminosities. 



3 OBSERVATIONS 

We have applied the technique described in the preceding section 
to two QSOs, J2 11444-005533 and J073 149+285449, each with 
two high H I column density absorbers along the sightline. Details 
concerning the quasars and the absorbers are provided in Table [TJ 
The last column lists the fraction of the u-band filter transmission 
g{\) that covers the FUV window Xah < A < Aj^ in which the 
DLA can be imaged: 



4 The HST-WFC3 observations have been scheduled for the ongoing Cycle 
17, proposal ID 11595. 
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Figure 2. Keck u- and ij-band imaging of the fields J21 1444-005533 (top) and J073 149+285449 (bottom). The quasars are visible only in the ij-band images 
(left panels) because the intervening LLS completely absorbs the light in the u-band (right panels). Therefore, faint galaxies in the foreground of the LLS can 
be detected even if they are spatially coincident with the quasar. Galaxies detected in the u band are labeled as in Table i*[T] while the QSO position is marked 
with a circle of 0.5" in radius. The solid lines are 2" long (~ 15 h^ kpc at z = 3). Galaxy F in the field J21 1444-005533 is the only object visible in the 
SDSS images. 



J LL 3(A)dA 

f(FUV) = -^p . (2) 

/ ff(A)dA 
Jo 

In the case of J21 1444-005533, the "blocking" absorber is a system 
at z ~ 3.44, associated with the quasar, while the target absorber is 
a super Lyman-limit system (SLLS; or a sub-DL^, at z ~ 2.92. 
Conversely, for J073 149+285449, the blocking absorber and the 
intervening DLA are at z ~ 3.55 and z ~ 2.69, respectively. 



Imaging of the fields of J21 1444-005533 and 
J073 149+285449 was obtained at Keck I using LRIS. The 
first field was observed in October 2008, during a photometric 
night, and the second field in January 2009, during a stable but non- 
photometric night. A set of short exposures for J073 149+285449 
were subsequently acquired in a photometric night for flux 
calibration. The blue side of LRIS is equipped with a 2x2Kx4K 
back-side-illuminated Marconi CCD with a plate scale of 0.135 
" pix~ . Before June 2009, a 2Kx4K front-side-illuminated Tex- 
tronic CCD with a plate scale of 0.211 " pix -1 was in operation 
on the red side. 



5 J21 1444-005533 has been included in our sample since the measured 
value of column density together with the associated error places this object 
at the edge of the DLA classification. 
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Table 1. A summary of the properties of the absorbers detected towards J21 1444-005533 and J073 149+285449, and of the quasars themselves. Quasar 
magnitudes in the r band are from the SDSS. Also listed are the observed wavelengths of the Lyman limit of the LLS (Aj^), and the fractions of the filter 
covered in the FUV window, as defined in the main text. 
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J073 149+285449 


3.68 


18.47 


3.55 


4152 


2.69 


20.55 


0.73 



3.1 Imaging 

We acquired multiple exposures for each target, dithering ~15" 
to remove CCD defects in the final image. A summary of the ob- 
servations is in Table [2] By splitting the incoming light through a 
dichroic mirror (50% transmission at 4874A), R-, V- and /-band 
images were obtained for J21 1444-005533, simultaneous with the 
it-band exposures. During the observations of J073 149+285449, 
water vapor condensed on the window of the red-side camera, pro- 
ducing a halo around the quasar (see bottom-left panel of Figure 
[2). For this target, besides the it-band image, we hence only ac- 
quired R- and l/-band images, which have limited value. Obser- 
vations were taken close to the meridian in order to minimise the 
atmospheric extinction. Seeing conditions were good (FWHM~ 
0.6" — 0.8" in the u band). The data were reduced following stan- 
dard procedures. After the bias subtraction, we applied twilight flats 
and then averaged background-subtracted exposures after scaling 
them to a common zero. A weight proportional to the background 
variance was adopted for the stacking. 

Photometric calibrations were obtained by observing mul- 
tiple photomet ric standard st ars in the fields PG2213— 006 and 
PG09 18+029 dLandoltl [1992T) . A photometric zero-point in AB 
magnitude was fitted together with a color term, assuming fixed 
air mass coefficients typical for the atmosp here in Mauna Ke a 
(0.41, 0.12, 0.11 and 0.07 for u, V, R, and 7; ICooke et afll2005h . 
For galaxies with fluxes affected by the quasar emission in all fil- 
ters besides it-band, we set the color term to zero, assuming a 
flat continuum typical of star-forming galaxies. Uncertainties on 
the final zero-point are between ~ 0.05 and ~ 0.02 mag. for 
J21 1444-005533, while between ~ 0.07 and ~ 0.04 mag. for 
J073 149+285449. The higher uncertainty for J073 149+285449 is 
due to the intermediate step required to extrapolate the zero-point 
from shallow exposures acquired in photometric conditions. Cor- 
rections for G alactic extinction (Tab le[2) are computed from the far- 
IR dust map of lSchlegel et alJdl998D . The extinction A in each filter 
is computed as A(u) = A.8E(B - V), A(V) = 3 1E(B - V) 
A(R) = 2.3E(B - V), and A{I) = 1.5E(B - V) dCardelli et al.1 
1989). Under good seeing conditions, a total Keck-LRIS exposure 
time of ~ 90 min. enables a depth of ~ 29 mag. at la for a 1" aper- 
ture in the it-band images. This sensitivity allows the detection of 
a star formation rate of ~ 1.5 h^ 2 M© yr _1 at 3a significance for 
a z — 3 target, once we correct for IGM absorption (see Sect.[6](. 
Exposure times and depths in each filter are listed in Table|2] 

3.2 Photometry 

Candidate host galaxies we re selected from the it- band images us- 
ing the SExtractor package ( Bertin & Arnoutsll99q) . The detection 
threshold was set to 1.4a with a minimum area of 3 pixels; these 
parameters force the inclusion of faint sources. We include in the 
final catalogue only galaxies within a projected angular distance of 
b < 12" from the quasar (corresponding to a proper distance of 




Figure 3. 7?-band image of J21 1444-005533, after quasar subtraction using 
standard PSF modelling techniques. Galaxies detected in the u band are 
labeled as in Table ,A[T] 

~ 90 h^2 kpc at z = 3). This search area is slightly larger than the 
maximum impact parameter of an absorber (~ 10"), as inferre d 
from absorption line statistics dStorrie-Lom bardi & Wolfe |2000|) . 
Since the region under consideration is small, we can inspect the 
segmentation maps to clean the catalogue of spurious detections or 
to include undetected sources, if any. 

Integrated magnitudes are computed within Rron-like ellipti- 
cal apertures. The background is subtracted locally, measuring the 
sky mean value in a square box 40 pixels on a side, centered on 
the target. All the pixels flagged as belonging to an object are first 
masked, and the sky variance is added to the Poisson error from the 
source to compute the uncertainty on the flux measurement. The 
final uncertainty also includes the error on the photometric cali- 
bration. To test the accuracy of the photometry, we have simulated 
Keck u-band obser vations for di fferent seeing conditions with the 
software SkyMake (Bertin 2009). For good seeing (0.6"), we are 
able, on average, to recover > 95% of the total flux at la down to 
u = 25 mag, while, for the faintest magnitudes (it > 26 mag), this 
fraction approaches ~ 90%. In all cases, the total flux is fully re- 
covered within 2a significance. For worse seeing conditions (1.0"), 
the fraction of recovered flux drops slightly, as expected. Due to the 
increase in the uncertainty, the fraction of flux recovered at la re- 
mains constant. 

An additional source of uncertainty comes from possible leak- 
age of the quasar flux. Although m s 3> 1 for the Hi column den- 
sities of the higher- z LLSs, we conservatively test for any possible 
contamination from the quasar by comparing the surface bright- 
ness in a box centered on the quasar region with a local sky de- 
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Table 2. Log book of the imaging observations taken at Keck-LRIS. 



Field R.A. Dec. UTDate Filter Exp. Time FWHM la depth E(B-V) 

(J2000) (J2000) (s) (") (mag in 1" ap.) (mag) 

J21 1444-005533 21:14:43.9 -00:55:32.7 2008 Oct, 2 nd u 6x900 0.6 29.20 0.062 

V 6x220 0.6 28.22 
R 6x220 0.6 27.99 
i" 6x245 0.6 27.59 

J073 149+285449 07:31:49.5 +28:54:48.7 2009 Jan, 28 th u 6x900 0.7 28.88 0.055 

V 6x360 0.8 27.83 
R 6x360 0.7 27.60 



termination. The difference in surface brightness normalised to 
the sky variance is A m /ct = -0.01 for J21 1444-005533 and 
A M /cr = -0.03 for J073 149+285449. Since these discrepancies 
are within a few percent of the sky variance in both cases, we con- 
clude that the intervening LLSs are effective in fully blocking the 
light from the background quasars. 

Table ^T|in the appendix provides photometric information 
for objects detected at S/N > 3. However, we will conservatively 
consider only targets with S/N > 5 in the u band to be candi- 
dates for the DLA counterparts. The photometry in the R, V, and 
/ filters is mainly intended to provide colors for the photometric 
redshift analysis, rather than an accurate determination of the total 
flux. To alleviate color gradient effects and seeing differences, we 
compute the half-light radius (rhi) for our targets on a white im- 
age, produced by stacking the R-, V-, and /-band images, where 
available. For all the galaxies detected in the u band, we compute 
colors in circular apertures, multiples of rui- Some candidates lie at 
very small impact parameters to the quasar sightline and their col- 
ors need to be corrected for quasar contamination. Therefore, we 
model and subtract the quasar light- profile by fitting a 4 th -order 
b-spline model (See Appendix A of iBolton et all2 006). An exam- 
ple of the residual image in the R band is presented in Figure [3] 
for J21 1444-005533. Although the result is quite satisfactory, the 
residuals may still affect the photometry and we choose to use only 
colors which are stable to the quasar subtraction. As already noted, 
the red-side images for the field of J073 149+285449 were affected 
by instrumental problems; since we are not able to model the scat- 
tered light, we do not present colors for this field. 



3.3 Impact parameter 

The impact parameter b is defined as the proper distance at the ab- 
sorber redshift between the line of sight to the quasar and the center 
of the absorbing galaxy, the latter computed as the first moment of 
the light distribution. Since the quasar is completely absorbed in 
the u band, we transfer the quasar position from the .R-band im- 
age to the it-band image, using accurate relative astrometry. This 
is done by first fitting an astrometric solution over the i?-band im- 
age, using stars with known positions. We then fit a second astro- 
metric solution to the it-band image, using more than five refer- 
ence objects, whose positions are extracted from the .R-band im- 
age and selected to be within ~ 20" from the quasar. Using this 
procedure, we achieve a high accuracy for the distances of objects 
close to the QSO sightline, better than that obtained from a single 
astrometric solution. The typical errors on the angular separation 
from this procedure are 0.05" for J21 1444-005533 and 0.07" for 
J073 149+285449 (corresponding to ~ 0.4 hf 2 J kpc at z = 3). Once 
the quasar position is known in the u band, we compute the pro- 



jected quasar-galaxy angular separation (b a3 ) for each candidate 
host galaxy. The angular distance is then converted into a phys- 
ical separation b p , assuming comoving distances of D c = 6102 
hfa 1 Mpc for J21 1444-005533 and D c = 5868 Mpc for 
J073 149+285449. The impact parameters obtained for the differ- 
ent candidates are listed in Table [3] The lowest impact parame- 
ter for candidates in the J073 149+285449 field is ~ 1.54" (i.e. 
1 1.89 h^ 1 kpc at z = 2.686, the DLA redshift), while that for sys- 
tems in the J21 1444-005533 field is - 2.86" (i.e. 21.61 h^ 1 kpc at 
z = 2.919). 



4 IDENTIFICATION OF THE DLA HOSTS 

The it-band images described in the previous section reveal a num- 
ber of candidates for the DLA host galaxy within an angular dis- 
tance of ~ 12" from the quasar. We wish to identify which, if any, 
of these is responsible for the observed damped absorption line. 
In studying absorption line systems, the impact parameter to the 
quasar sightline is often used to identify host candidates, following 
the general rule that the nearest object to the line of sight is likely to 
be the galaxy that causes the absorption. However, as will be shown 
later, the number of interlopers increases significantly in very deep 
searches; for this reason, a more quantitative treatment is needed. 
In this section, we discuss the two most straightforward methods 
to confirm that the DLA indeed arises in one of the candidate host 
galaxies. A more indirect, statistical approach to quantify the rela- 
tive probability that one of the candidates gives rise to the DLA is 
described and discussed in the next section. 

4.1 Spectroscopy 

The only way to confirm a galaxy-absorber association for each 
system is through a spectroscopic detection of the galaxy, with 
an emission/absorption redshift consistent with the redshift of the 
DLA. For targets close to or aligned with the quasar, detecting Lya 
emission in the DLA trough is a simple way to measure the red- 
shift of the host galaxy. The quasar light is blocked by the damped 
Lya absorption, and one can hence s earch for Lya em ission from 
the same redshift with impunity (e.g. lM0fler et al.ll2004l) . Previous 
searches for Lya emission were mostly limited by lack of knowl- 
edge of the location of the star-forming regions in the host galaxy, 
due to which it was not clear where to place (and how to orient) 
the slit for a spectroscopic search. This meant that a non-detection 
of Lya emission might arise simply because the brightest regions 
of the host galaxy were not covered by the chosen position and 
orientation of the slit. Crucially, our survey will directly yield the 
positions of the candidate host galaxies, allowing follow-up spec- 
troscopic studies to correctly position slits on all candidates. Note 
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Table 3. Projected angular and physical distances from the QSO sightline 
for each candidate host galaxy detected in the M-band images. Also quoted 
is the frequentist probability of the candidate's being an interloper. 
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that the detection of other spectral lines at optical wavelengths is 
likely to be affected by the bright quasar continuum. However, the 
Ha transition, redshifted into the near-IR waveband for DLAs at 
z > 2, is the other plausible transition by which the galaxy redshift 
can be measured. 

Beside Lya (or Ha) emission, the technique adopted here al- 
lows an alternative route to confirm or at least constrain the galaxy 
redshift. Figure [T] shows that the QSO contamination disappears 
blueward of the Lyman break of the higher- redshift LLS; any con- 
tinuum detected in this part of spectrum comes only from fore- 
ground objects that are in the slit. Therefore, we can establish the 
redshift of the candidate by identifying other absorption features 
such as metal lines. Furthermore, a less precise but still useful red- 
shift determination can be obtained by searching for a signature of 
the galaxy Lyman limit, if visible redward to the atmospheric cut- 
off. 

A limitation of the above spectroscopic methods is that dust 
extinction can suppress the Lya or FUV emission. Also, our poor 
knowledge on the escape fraction of Lya photons makes it very dif- 
ficult to estimate the expected Lya fl ux at a given UV luminosity 
(see for example Matsuda et alj2004h . In addition, the detection of 
the galaxy continuum in spectra can only be obtained within a rea- 
sonable integration time (~ 1 — 2 hours at a lOm-class telescope) 
for targets brighter than 25 or 26 magnitudes. As discussed in the 
introduction, simulations (and some observational studies in the lit- 
erature) suggest that DLAs may be associated with even fainter ob- 
jects. Finally, the Ha line is only observable from ground-based fa- 
cilities from a narrow redshift range. For these reasons, it may not 
be possible to spectroscopically confirm all candidate host galaxies 
and different approaches are required. 



4.2 Photometric redshift 

A second method to determine the redshifts of the candidate host 
galaxies is via a photometric redshift ("photo-z") estimate. The ad- 



vantage of this technique over spectroscopy is that it can also be 
used for faint galaxies. Unfortunately, there are two main issues 
that affect the photo-z analysis. First, quasar contamination does 
not allow a robust color estimate for targets at low impact param- 
eter, i.e. those more likely to be associated with the DLA (see be- 
low). For this reason, the photo-z method can only be used to es- 
timate the redshift for targets at large projected distances from the 
quasar. Second, photometry in 4 optical filters covers only a nar- 
row range of the blue part of the spectral energy distribution (SED) 
of a galaxy. This significantly increas es the number of catastrophic 
outliers (e.g. Hildebrandt et alll2008h . making the results less reli- 
able. It is worth mentioning that one can try to constrain the redshift 
through photometry by combining our ground-based u-band imag- 
ing with HST UV observations in narrow- or medium-band filters. 

As noted earlier, instrumental problems caused colors for the 
galaxies in the field of J073 149+285449 to be contaminated by 
scattered light from the quasar, making these colors unreliable for 
a photo-z analysis. For the J21 1444-005533 field, we have com- 
puted photometr ic redshifts for the c andidate host galaxies using 
the e azy code terammer et alj2 008) and an SED template library 
from lGrazian et aL 1 420061) . The fit was performed on a grid of red- 
shifts ranging from 0.01 to 4 with a resolution Az — 0.01, includ- 
ing the effects of IGM absorption. Although the code allows linear 
combinations of SEDs, we used individual templates, without pri- 
ors on the galaxy magnitude. We initially focused on targets A and 
C in the J21 1444-005533 field since these are the two galaxies with 
lowest impact parameters to the quasar line of sight. For J21 1444- 
005533-C, the best-fit redshift is z = 2.65. The SED, the syn- 
thetic fluxes (red square), and the observed fluxes (blue triangles) 
are shown in Figure[4] with the \ 2 distribution as a function of red- 
shift displayed in the inset. No other significant relative minima are 
found besides the two in the same redshift interval. Conversely, for 
target A, the best fit is at z — 2.50, but a second minimum is found 
at z ~ 0.2, making this redshift determination less secure. Unfor- 
tunately, the limited number of available filters means that neither 
redshift can be constrained at a high confidence level (C.L.). In fact, 
although J211444-005533-A and J21 1444-005533-C appear to be 
located at lower redshifts than the DLA {z aos = 2.919), we can- 
not rule out the galaxy-absorber correspondence at > 3<r C.L. for 
either candidate; this illustrates the problems with the photo-z ap- 
proach, and emphasizes the need for spectroscopic confirmation. 
Among the other galaxies with color determinations in the field of 
J21 1444-005533, we do not find any significant DLA candidates 
using the photo-z approach. 



5 STATISTICAL APPROACHES 

As discussed, spectroscopy is required to securely identify the host 
galaxies of DLAs. However, besides being an expensive observa- 
tional task, it may even be unsuccessful in some cases. Statistical 
approaches to quantify the probability that a given galaxy is associ- 
ated with a DLA are therefore valuable. Similar to the identification 
of optical counterparts for radio and X-ray sources, we would like 
to estimate the probability that a given galaxy is associated with a 
DLA, given some observables (e.g. the impact parameter, the Hi 
column density, etc). We will use two different treatments for this 
purpose: (i) a frequentist approach, used to test whether a candi- 
date is an interloper, and (ii) a Bayesian estimator used to assign 
a probability that a candidate is actually associated with the DLA. 
Considered jointly, they can help to decide which galaxy (if any) is 
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Figure 4. Photometric redshift for galaxy J211444-005533-C: SED, syn- 
thetic fluxes (red square) and observed fluxes (blue triangles with errors). 
The inset shows the \ 2 distribution as a function of galaxy redshift. The 
fitted redshift (z = 2.65) suggests that this galaxy is not associated with 
the DLA at z^ia = 2.92, but we cannot rule out the galaxy-absorber corre- 
spondence at > 3(T C.L. 
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Figure 5. The probability Pt of detecting at least one interloper as a func- 
tion of the impact parameter b as , from Eq. (4)- Different lines are for dif- 
ferent cuts in u magnitude. In very deep surveys, there is a significant like- 
lihood of detecting an interloper for b as > 2". 



the DLA host, without the limitations imposed by color determina- 
tions or galaxy brightness. 

We stress that we do not aim to provide a secure galaxy iden- 
tification by this approach. Nevertheless, this technique is useful 
to pre-select the best candidates for spectroscopic follow-up. Also, 
when the present and future searches will yield a significant number 
of spectroscopically-confirmed galaxies, one can refine the statis- 
tical methods introduced here to select a bona fide DLA sample, 
useful to study the properties of the DLA population rather than 
those for individual detections. 

5.1 Frequentist approach 

The frequentist method is based on Poisson statistics applied to 
number counts of the surface density of galaxies. For each can- 
didate with an impact parameter b and apparent magnitude m, we 
can compute the probability of detecting one interloper in the pa- 
rameter space (< 6,< m). Low values for this probability indicate 
that the candidate is unlikely to be an unrelated object, suggesting 
that it is likely to be the DLA host or the host of a second absorber 
at lower redshift along the sightline. 

Given the surface number density of objects brighter than a 
fixed magnitude n — m), the mean number of interlopers 

expected for r ^ b is 

p = nb 2 n (3) 

and the probability to detect at least one galaxy is foownes et al.l 
fl986l) 

P f = 1 - e _p . (4) 

If Pf <C 1, it is unlikely that the candidate corresponds to an 
interloper. However, as widely discussed in th e literature (e.g. 
iDownes et al]|l986l : [Sutherland & Saundersll 19921) . the probability 
of the candidate being the right identification does not follow im- 
mediately as 1 — Pf. In fact, when multiple candidates lie within 
the search radius, the probability for each object is computed inde- 
pendently, leading to the ill-defined case in which the total prob- 
ability for all candidates is not unity. The correct probability of a 



galaxy-absorber association comes from a Bayesian treatment (see 
next section). 

For n, we use galaxy num ber counts derived bylGrazian et al.l 

J2009h (their Table 1; see also iRafelski et alj|2009t) from (7-band 
imaging in a wide sky region (~ 0.4 sq. deg.), down to U = 27.86 
AB mag.; this limit matches the depth of our survey. In Figure[5] we 
plot the dependence of Pf on the impact parameter, derived from 
Eq. Q for different magnitude cuts. This analysis outlines how two 
competing effects play a role: depth and confusion. In fact, deep 
imaging is desirable to increase the chance of detecting absorber 
counterparts, but at the same time the number density of interlopers 
increases steeply, introducing significant confusion even at low im- 
pact parameters. Specifically, deep surveys, to u magnitudes fainter 
than ~ 27 mag., have a significant probability (> 20%) of finding 
interlopers at impact parameters b as > 2". This result stresses the 
need for a quantitative treatment to identify DLAs: simply assum- 
ing the closest candidate to the quasar sightline to be the counter- 
part can lead to false detections. Note, however, that the figure also 
shows that this assumption is likely to be a good one for b as < 1" 
(corresponding to a physical distance of < 8.2 kpc at z — 2.5). 

Table [3] lists the frequentist probability Pf for all candidates 
in the fields of J2 11444-005533 and J073 149+285449. In the lat- 
ter field, candidate-A has a low probability of being an interloper 
(Pf — 0.06), making it the most likely candidate for the DLA 
host. Conversely, for J21 1444-005533, the minimum probability 
(Pf = 0.08) is for candidate-F, a bright galaxy at z ~ 0.3 (as 
measured in the SDSS and confirmed by our photometric redshift 
analysis), while the next lowest probability of being an interloper 
(Pf = 0.19, for candidate-A) is non-negligible. This case sug- 
gests that the frequentist analysis needs to be complemented with 
additional priors on the impact parameter, in the absence of spec- 
troscopic information. In fact, as we will show in the next section, 
it is quite unlikely that luminous DLA galaxies lie at very large 
distances (~ 60 h^ 1 kpc for J21 1444-005533-F) from the quasar 
sightline. 
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Table 4. Models for the impact parameter distribution from Eq. 1 1 7i . OBO 
is from Hi maps of local galaxies and SB3 from SPH simulations at z = 3. 
OB2, OB2.5 and OB3 assume redshift evolution of local Hi disks for z = 
3, 2.5 and 2. 
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0.37 


0.12 


1.29 


OB3 
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0.37 


0.14 
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5.2 Bayesian approach 

Although independent of any priors, the frequentist approach does 
not yield a relative probability that an identification is correct, 
nor does it take into account the fact that several candidates can 
be considered for a single DLA. Both of these can be achieved 
by a Bayesian treatment. The present section is organised as fol- 
lows: after a review of the basic Bayesian formalism, we pro- 
pose a method that can be applied to identify the host galaxies of 
generic absorption line systems (ALSs), such as Mgn absorbers, 
LLSs or DLAs. We then derive specific priors on the impact pa- 
rameters of DLAs based on theory and indirect observational con- 
straints. In the end, after testing this procedure on a sample of 
six spectroscopically-confirmed DLA hosts, we apply the method 
to compute probabilities for our targets in the fields of J2 11444- 
005533 and J073 149+285449. 



5.2.1 Formalism 

Different approaches based on Bayes' theorem have been devel- 
oped to identify the optical counterparts of X-ray or radio sources 
and, more recently, sub-mm sources. Due to the large number of 
works focused on this topic, varying terminology has been intro- 
duced over the past years. To make explicit our choice, we will 
review th e fundamental concept s at the base of this method, mostly 
following lRutledge et al J i2OO0h . Further, we also optimize the pro- 
cedure for the issue addressed in this paper, namely the identifica- 
tion of the galaxy counterparts of high-z absorbers. 

For a group of M candidates, the likelihood ratio LR is de- 
fined as the product of the normalised probability distribution func- 
tions (PDFs) of some properties x a i s of the ALSs to those of ran- 
dom foregrouncjfl galaxies. The useful physical quantities are var- 
ious observable parameters, including magnitude, impact parame- 
ter, H I column density, metal line equivalent widths, and kinemat- 
ics. However, while the inclusion of many properties enables a nar- 
rower distribution of the likelihood ratio which restricts the number 
of false detections, this method is sensitive to the functional form 
adopted for x a i s . To avoid subtle biases, it is hence better to restrict 
the number of priors to only well-known quantities. Here we con- 
sider a simple case in which only priors on the impact parameter 
f(b) and magnitu de distribution q(m) are ass umed. 

Following ISutherland & Saunders! fl992h . we define LR as 



6 Due to the design of our experiment, objects at redshifts higher than the 
LLS one cannot be detected in the proximity of the quasar. Therefore we 
define interlopers as foreground galaxies, even though this is not entirely 
appropriate for objects at redshifts between those of the DLA and the LLS. 



Q(m)f(b) 
n(m) 



(5) 



LR is the ratio of the probability p of detecting a real counterpart 
at an impact parameter b and magnitude m 

p = q(m)f(b)2irb db dm (6) 

to the probability of detecting a random foreground object 

p — n(m) 2-Kb db dm , (7) 

where n(m) gives the distribution of galaxy number counts per unit 
area. This last quantity is not related to the nature of any particular 
ALS, and it can be derived empirically from deep imaging; as wit h 
the frequentist approach, we use the result of lGrazian et alj j2009h . 

According to Bayes' theorem, the reliability R of a correct 
identification is 



Rals(LR) = 



P(true, LR) 



P(true, LR) + P (false, LR) 



(8) 



which is the ratio of the probability of true associations to the 
sum of true and false associations. R a i s expresses the probabil- 
ity that a candidate with a given LR is the correct identifica- 
tion and not an unrela t ed f oreground object. As pointed out by 
ISutherland & Saunders! ( fl992h . equation l[8) does not account for 
the fact that multiple candidates can be considered for a single ab- 
sorber. In other words, a high value of R indicates that the con- 
sidered candidate is an unusual source compared to the foreground 
galaxies, but frequently high reliability is assigned to more than one 
object. Eq. ([8) provides no insight to solve this ambiguity. 

To add this missing information, we introduce two other statis- 
tics. The first one is the probability P no ,id that none of the M pos- 
sible candidates is associated with the ALS: 



p 



n^i-Pj 



(9) 



The second is the probability P a i s ,i that the i-th source is uniquely 
associated with the ALS: 



s 



(10) 



In the previous two equations, S is a normalization factor that en- 



sures that P noM + £ j=1 P a u,i 

M 

S = ^2R i Il& i (l-R j 



Ri 



(11) 



In the end, Eq. d 1 Ofc is the quantity that will be used to identify 
likely galaxy-absorber associations. 

Before we apply this procedure to the case of DLAs, we high- 
light a possible problem that can affect the computation of LR 
with Eq. ((5). For ALS studies, the form of the prior q(m) has to 
be chosen carefully. Properties of ALSs in emission are currently 
poorly constrained and very little or nothing can be inferred about 
q(m) from observations. Simulations can only partially help, espe- 
cially because the star formation rate and stellar emission here are 
mostly computed based on semi-empirical prescriptions; this im- 
plies that any priors derived from simulations may not be reliable. 
Conversely, the use of the observed luminosity functions of high- 
redshift galaxies might imply a strong a priori constraint on the 
nature of the ALS counterparts. Furthermore, although with signif- 
icant noise, q(m) can be obtained in a statistical sense by subtract- 
ing the magnitude distribution of galaxies in fields without DLAs 
from that in fields with known absorbers. This procedure requires a 
significant number of fields for convergence, but these observations 
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Figure 6. Left: Probability distribution function f a i, a (b) in one realisation of the quasar experiment (dashed histogram) and for the model OB0 (solid line). 
Right: Cumulative distributions f f, 3 (> b) (dashed line) and f b s {< b) (solid line). According to this prior, ~ 50% of DLAs at z = 3 are expected to lie 
within 1" from the quasar, with the maximum probability located around 0.5". 



are currently unavailable. Without a reliable estimate for q(m), we 
suggest reducing the number of priors in the likelihood ratio, rather 
than adopting an inappropriate choice that might introduce uncon- 
trolled biases. Note that one of the goals of our survey is to charac- 
terise the star formation properties of DLAs; incorrect information 
on the magnitude prior might have significant implications for the 
final result. 

To remove q(m ) from the likelihood ratio, following 
ISutherland & Saunders! dl992h . we modify the definition of the 
likelihood ratio by marginalising Eq. $5$ over rn. We define LR a i s 
as 

Q(m)f(b) 



LRals — 



where 



Q(m) 



M(mi) 



q(m)dm 



and 

M(mi) = / n(m)dm 



(12) 



(13) 



(14) 



are the priors q(m) and n(m), integrated up to the limiting mag- 
nitude mi. Both Q and M are constants; the fact that q(m) is un- 
known implies that the likelihood ratio has now an unspecified nor- 
malisation. Therefore, we adopt an operational definition of Eq. {8} 
as the probabilit y of not obta ining R a is,i randomly for the i-th 
candidate dGilmour et alj|2007l) . The idea behind this procedure is 
to compute a distribution for LR using several sets of interlop- 
ers (Ni n t). High reliability is assigned to candidates whose LR 
exceeds typical values found among interlopers. Formally, this is 
granted by 

_ 1 N(LR > LR aU ,i) 



Ra 



Ni„t 



(15) 



where N(LR > LR a i Sti ) is the number of interlopers with a like- 
lihood ratio that exceeds LR a i Sti . N int should be large enough 
to guarantee the convergence of R a i Sl i, Because the condition 
LR ^ LR a i 3t i in Eq. d 1 5 b is satisfied modulo an arbitrary posi- 
tive constant, the final reliability is independent of Q and N. Since 
the likelihood ratio distribution is computed directly from the imag- 
ing (see Sect. l5.2,3l l, this procedure offers the additional advantage 



of treating the limiting magnitudes mi self consistently. The down- 
side of this empirical approach is that we lose knowledge on n(m), 
a well-defined quantity. However, we complement the Bayesian 
treatment with the frequentist approach, which includes the number 
count statistics. 



5.2.2 Impact parameter modelling 

Considering the specific case of DLAs, the only unspecified quan- 
tity at this point is the prior on the impact parameter f(b). This 
can be derived from observations if a sample of spectroscopically- 
confirmed objects is available. Unfortunately, the hosts of only six 
DLAs at 2 > 2 have so far been confirmed with spectroscopy, 
implying that it is not currently possible to use the observed im- 
pact parameters to constrain f(b). We will hence indirectly derive 
a prior on b; the downside is that the final derived probabilities will 
carry additional uncertainty. In the future, an updated form of f(b) 
derived directly from observations can provide a more reliable prior 
for statistical analysis. 

Here we introduce and compare two different priors. The first 
one is based on the ACDM cosmology framework, in which galax- 
ies assemble through a series of minor and major mergers. During 
this process, gas is thought to be distributed in clumps and filaments 
which do not necessarily resemble low-redshift disks. In addition, 
gas in individual halos can cool to form a disk whose size follows 
the size evolution of the dark matter halo. For this reason, we re- 
fer to this prior as "evolutionary". Co nversely, our second prior i s 
based on the observational results of Prochaska & Wolfe (2009), 
who used a large DLA sample (~ 1000 DLAs) to find that the 
shape of the frequency distribution of projected H I column densi- 
ties f(Nni, X) does not evolve significantly with time at z > 2 
and also matches the one at z ~ 0. This implies that the convolu- 
tion of the projected Hi surface density distribution in individual 
DLAs, their sizes and number density i s preserved over ~ 1 Gyr . 
A stronger interpretation presented by IProchaska & W olfe ( 2009 ) 
is that galaxies have Hi disks whose distribution matches 

that of present-day spirals, a result that in turn suggests how H I in 
individual galaxies could be not especially sensitive to the under- 
lying dark matter distribution. Therefore, we refer to this second 
prior as "non-evolutionary". Further investigations are required to 
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Figure 7. Left: The probability distribution function f s i m (b) from an SPH simulation at z = 3 (dashed histogram) and for the fitted model SB3 (solid line). 
Right: Cumulative distributions f s im{> b) (dashed line) and / s ; m (< b) (solid line). Also shown, cumulative distributions derived with a toy model for z = 3 
(OB3), z = 2.5 (OB2.5), and z = 2 (OB2) (blue dotted, green long-dashed, and red dash-dotted lines). According to this prior, ~ 50% of DLAs at z = 3 are 
expected to lie within 0.5" from the quasar with the maximum probability located around 0.3". 



confirm or disprove this hypothesis, but for now we note that the 
non-evolutionary prior is also useful to account for more extended 
H I than the one found inside simulated disks at high redshift. 

We construct the non-evolutionary prior f bs(b) by simulat- 
ing the quasar experiment, using sightlines through H l-21cm maps 
of local gala xies to reproduce the DLAs seen against background 
quasars (c.f. IZwaan et alj|2005ah . For this purpose, we u se Hl- 
21cm maps from the THINGS survey (Walte r etal.ll2008l) which 
includes 22 spirals and 12 Sm/dlrr galaxies at a resolution of ~ 7". 
For each galaxy, we measure the local H I column density by av- 
eraging the signal within a resolution element in ~ 200 random 
positions. We re-project the observed Nh i to a variety of inclina- 
tions, thus accounting for the fact that face-on disks are more likely 
to be selected in absorption than edge-on ones. For H I column den- 
sities above the DLA limit, we then compute f b s (£>) by combining 
all the different sightlines for each galaxy. To reproduce a popu- 
lation of galaxies, we weight each o bject in the THINGS sample 
with the H I mass function (HIMF) 9 fewaan et all 2005b) and the 
sky covering factor A. This is computed assum ing that the H I ra- 
dius scales with the H I mass <Verheiierj|200lh . We also include a 
correction factor proportional to the number of galaxies (N ga i) in 
a given mass range (that defines Nbin bins) to compensate for the 
fact that dwarf galaxies are undersampled in the THINGS survey 
with respect to spirals. Combining all of these elements, we derive 
fobs(b) with 

ESl" h (Efe M Q(M k )A(M k )f k (bj) 

/o6s(6) = — n — r7 n ,. w 

ESf h &(M k )A(M k )) 

It is worth mentioning that high-redshift DLAs do not probe exclu- 
sively sight-lines similar to the ones through local disks as seen in 
21cm. In fact, highly ionized species (e.g. N v) most likely associ- 
ated with the halo are sometimes observed (Fox et al. 2009). How- 
ever, the use of local 21cm maps seems an appropriate analogy to 
model H i-rich galaxies at high redshift. 

In the left panel of Figure [6] we show fobs (6) (dashed his- 
togram) from one realisation of the above equation dl6t . To model 
this distribution, we fit a function of the form 

f(b) = Ab a exp(-Bb fl ) . (17) 



This analytic formula is designed to reproduce f(b) for local galax- 
ies: the power law accounts for the increasing probability of inter- 
secting a disk at larger radii, while the exponential term accounts 
for the radial decay of the H I surface density profiles. The solid line 
shows the fit computed over 50 such experiments; the derived pa- 
rameters and statistical uncertainties for this non-evolution model 
(OB0) are quoted in Table|4] It is reassuring that, although we are 
using a smaller s ample, f b s (b) resem bles qualitatively the distri- 
bution derived bv lZwaan et al.1 d2005aj) . In the right panel of Figure 
[6] we plot the cumulative distributions fobs(> b) (dashed line) and 
fobs(< b) (solid line), obtained from the OB0 model. From this 
analysis, we infer that ~ 50% of DLAs at z = 3 are expected 
within 1" from the quasar with the maximum probability located 
around 0.5". Since DLAs are expected with low probability at im- 
pact parameters ^ 40 h^ 1 kpc (~ 5" at z = 3), our search radius 
of 12" seems large enough to guarantee sufficient sky coverage 
during our candidate selection. 

Turing our attention to the evolutionary prior, we derive 
/sim(fe), using a cosmologi cally-weighted samp le of DLAs drawn 
from the SPH simulation of Pontzen et al. (2009). This is similar to 
the simulation presen ted in Go vernato et al.1 d2007t) and analysed in 
IPontzen et al.1 d2008l) . but at higher resolution. According to these 
authors, the impact parameter is defined as the projected distance 
to the minimum of the dark-matter halo potential. This is not an ob- 
servable quantity, but it is reasonable to assume that high star for- 
mation occurs when the gas funnels towards the center of the halo, 
so that this definition of the impact parameter does not yield differ- 
ent results from our observational one. It is useful to note that by 
selecting individual halos, we are considering the gas distribution 
inside individual galaxies, with no distinction between the central 
galaxy and satellites (see Sect. 15.2.3V 

The dashed histogram in the left panel of Figure [7] shows 
fsim(b) from a realisation of DLAs at z — 3 from the SPH simula- 
tions of IPontzen et all J2009I) . We model f s im{b) (solid line) using 
the fitting formula in Eq. dl7| l; although designed for local galax- 
ies, this function seems flexible enough to describe also the shape 
of fsim (b) in high-redshift ACDM simulations. The only discrep- 
ancy with the data arises at high b. The fitted parameters for this 
evolutionary model (SB3) are quoted in Table|4] In the right panel 
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Figure 8. The results from 200 trials of the Bayesian procedure, with the probability assigned with the evolutionary prior to 6 known DLAs from the literature. 
The probability Pdi a ,i assigned to the correct DLA is shown with a solid line, and that for the interloper with the highest reliability with a red dashed line. 
The blue dotted line indicates the highest probability assigned to foreground galaxies in a control test with no DLAs in the field. In all but one case (the 
SLLS towards 2233.9+1318), the Bayesian method assigns the highest probability to the correct galaxy-DLA association. The vertical dotted lines indicate 
Pdla i = 0.8 and Pdia,i = 0-6. From this analysis, we infer that, in an ideal experiment, we expect to detect 60 bona fide counterparts out of 100 fields which 
host detectable DLAs when we assume the criterion Pdia,i > 0.8, while 15 interlopers will be incorrectly classified as DLAs. 



of Figure[7] we plot the cumulative distributions f S im(> b) (dashed 
line) and f S im(< b) (solid line), as obtained directly from the data. 
From these simulations, we deduce that a DLA can be found with 
~ 50% probability within an impact parameter of ~ 0.5" at z = 3. 
The maximum probability is located around 0.3", roughly a factor 
of 2 less than that predicted in a non-evolutionary model. These 
low values stress the advantage of our drop-out technique in obser- 
vations limited by seeing. From a comparison of f s jm (< b) with 
the rate of incidence derived by Nagamine et al. (2007), we note a 
similarity with their no- wind run, although our distribution exhibits 
a narrower tail. 

The major difference between models OB0 and SB3 resides 
in the fact that gas follows the dark-matter potential more closely 
in simulations than in the non-evolving model, which assumes that 
the gas distribution does not change with redshift. This is reflected 
in a distribution for OB0 that is broader and peaks at higher impact 
parameters than the one for SB3. To extrapolate the simulation re- 
sults to z < 3, we scale the Hi distribution observed at z = 
following the size evolution of dark matter halos as a function of 
redshift. Starting with Hl-21cm maps of local galaxies, we repeat 
the quasar experiment as for OB0, but this time accounting for a 



redshift dependence of b. In this toy model, we keep the observed 
surface density distribution constant, assuming that the total Hi 
mass in the halo increases due to gas accretion onto the disk. In 
the literature, several scaling relations for the galaxy radius as a 
function of the redshift can be found, from both theoretical argu- 
ments and observations of high-z galaxies. In our model, we adopt 
r(z) oc H(z)~ 2/3 oc (1 + z)' 1 iBouwens et alj|2004l) for z > 1. 
This is in agreement with Papov ich et al.l d2005l) who show that the 
size distribution of galaxies at z < 1 is broadly consistent with that 
observed in the local Universe. This particular choice enables us to 
reproduce almost perfectly the SM3 model, starting from H 1-2 lcm 
maps of z — galaxies. This agreement is shown in the right panel 
of Figure [7] where the extrapolated model (OB3) is shown with a 
blue dotted line. The only significant discrepancy arises for b > 7 
kpc The figure also includes the cumulative distribution f s i m 
at z = 2.5 (OB2.5) and z = 2 (OB2) (green long-dashed and 
red dash-dotted lines, respectively). The parameters of the analytic 
expression (T7J are listed in Table|4] 

Being able to match the simulations with an ad hoc r(z) may 
not seem an interestin g result. However, other scaling relations (e.g. 
iFerguson et alj|2004l) are equally able to reproduce a distribution 
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at least consistent with the simulations. We speculate that there 
might be a more profound reason for this agreement: gas clumps re- 
assemble in growing dark matter halos without a drastic change in 
the radial distribution of the Hi column density since z — 3 or even 
beyond. Further investigations on the gas distribution in SPH simu- 
lations are desirable to investigate this hypothesis. While account- 
ing for disk evolution, we have assumed that the weighting proce- 
dure defined in Eq. d!6t does not change as a function of redshift. 
Note that the impact parameter distribution is a normalised quan- 
tity, and is hence not much affected by any mass-independent vari- 
ation in the number density of H I galaxies or in the covering factor 
A. Conversely, a change in the slope of the HIMF may alter the rel- 
ative contribution of massive and dwarf galaxies, altering f(b). No 
direct determinations of the HIMF as a function of redshift are cur- 
rently available, and we hence k eep the slope constant, consisten t 
with the semi-analytic model of lObreschkow & Rawlingsl ( 2009) 
for z < 3 (see their figure 1). 

5.2.3 Implementation, procedure test and discussion 

Once the priors on the impact parameter are known, we test 
this Bayesian procedure using a sample of six spectroscopically- 
confirmed high-redshift DLAs (see Appendix IB!. Although hetero- 
geneous, this sample provides the only present observational test. 
To evaluate the reliability with Eq. U5\ , we need a realisation of 
LR for foreground objects. For this purpose, we compute LR for 
all the galaxies detected within r aearc h < 10" of a random position 
in a field where no absorbers are known. We repeat this procedure 
for several random positions to guarantee the convergence of LR. 
Since we restrict to a searching area r < 10", we implicitly im- 
pose the condition LRdi a ,i = for b > 10". This well-defined 
boundary prevents probability from flowing towards high impact 
parameters. Note that, if r sear ch is allowed to increase to arbitrar- 
ily large radii, the number of interlopers at large impact parameters 
with small but non-zero LR will increase accordingly. Therefore, 
a non-zero reliability will be assigned also to DLA candidates with 
a low likelihood ratio, effectively decreasing Pdi a ,i f° r the most 
likely candidates. This issue is bypassed by limiting the search ra- 
dius to r < 10". 

After this, we extract all the sources detected within 10" from 
a random position in a field not hosting any known DLAs. Then, we 
add to this list of interlopers a known DLA at its measured impact 
parameter; finally, we compute Pdia,i an d P no ,id for all of these 
candidates using both the evolutionary (SB3, OB2.5 and OB2) and 
the non-evolutionary (OB0) priors. We repeat this test 200 times 
for each confirmed DLA. To estimate the number of interlopers 
that are incorrectly identified as DLAs, we also run a control test in 
which only foreground sources are included. The results are in Fig- 
ures [8] and [9] where we compare results for the evolutionary and 
non-evolutionary priors, respectively. For each known DLA, the 
probability Pdi a ,i assigned to the correct galaxy counterpart is in- 
dicated by a solid line, while that assigned to the interloper with the 
highest reliability is shown with a red dashed line. Finally, we dis- 
play the results of the control test in fields without DLAs: the blue 
dotted line represents the probability Pdia,i assigned to the fore- 
ground galaxy with highest reliability when only interlopers have 
been detected. 

Several pieces of useful information can be derived from the 
plotted distributions. First, looking at the six panels in Figures [8] 
and|9] it is evident that in all but one case our procedure assigns the 
highest probability to the correct candidate DLA host. We therefore 
conclude that the Bayesian method is successful in finding the right 
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Figure 10. The probability of non detection (P no id) from 200 trials of the 
Bayesian procedure, on the left (in red) for the non-evolutionary prior and 
on the right (in blue) for the evolutionary prior. P n0t id for the control test 
(C.T.; without known DLAs in the field) is highly dispersed, with a low 
mean value. In deep images, the total probability among the candidates is 
high due to the number density of interlopers and P no> id drops accordingly. 
The inset displays a zoom-in on the confirmed DLAs. 

galaxy-absorber association. The only evident failure is for the tar- 
get 2233.9+1318, an SLLS with N H i= 20.0 cm -2 . As shown by 
IZwaan et al.l d2005al) (see also Figure [TTt. the impact parameter is 
a decreasing function of the H I column density; using a prior de- 
rived for absorbers with Nhi^ 2 x 10 20 cm -2 may hence under- 
estimate the quasar-galaxy separation for absorbers with lower H I 
column densities by more than a factor of 2. Second, it appears that 
the evolutionary and the non-evolutionary priors reproduce similar 
values of probability. In fact, the relevant feature that distinguishes 
the two models is the location of the maximum. From the definition 
of Eq. H5\ , similar values of Pdi a ,i are expected with both priors 
when candidates lie on the tail of the f(b) distribution. 

It is also useful to note that the probability associated with the 
interlopers with highest R (red dashed line) peaks at low Pdi a ,i- 
This shows that Eq. l llOt distinguishes between targets that share 
a high reliability. Finally, considering P no ,%d (Figure ITOt. we no- 
tice that this statistic is not particularly useful in deep imaging. In 
fact, although it behaves as expected for the fields with a known ab- 
sorbers, for our control test in which only interlopers are included, 
Pno,id exhibits a highly dispersed distribution with a mean value 
around 0.1. This is not exceedingly higher, although a factor of 10 
larger, than the mean values ~ 0.01 — 0.03 derived from the other 
six experiments where DLAs are present. This is mostly due to the 
fact that very deep fields have a high number density of interlop- 
ers. When the number of candidates is large, even if no DLAs are 
detected in the field, 2~2 Pdi a ,% can increase enough to make P n0t id 
drop accordingly (since P no ,id + J2i=i PaU,i = 1)- 

The fact that P no ,id is not a useful indicator makes our analy- 
sis slightly more complicated. Our test shows that whenever a DLA 
is detectable in the field, the Bayesian procedure is able to cor- 
rectly identify it (as shown by the comparison between solid black 
and dashed red histograms). However, if the DLA is too faint to be 
detected, interlopers may be incorrectly identified as the absorber, 
without any warnings from P no ,id- We quantify the number of spu- 
rious identifications by using our control test. For this purpose, we 
use the frequency with which high probability is assigned to in- 
terlopers in fields without DLAs. This provides an estimate of the 
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Figure 9. The same as Figure [8] but for the non-evolutionary prior. 



contamination rate in our survey. Since the control test assumes that 
no DLAs are in the fields, this rate is somewhat overly-pessimistic. 
Finally, we note that our control test is not formally included in the 
Bayesian procedure, and the contamination rate we assume does 
not contribute to the probability P UO: id + YLfLi Pals,i = 1- There- 
fore, for a given probability limit Pu m on Pdi a ,i, the frequency 
with which a DLA is correctly identified (i.e., the number of trials 
for which Pdia.i > Plim in the solid histogram) and the frequency 
with which an interloper is incorrectly identified as the absorber 
(i.e., the number of trials for which Pdia.i > Pum in the dotted 
histogram) do not add up to one. 

Nevertheless, these rates provide two extreme cases, useful to 
estimate the completeness and contamination in a bona-fide DLA 
sample derived with statistics. Our tests indicate that for galaxy- 
absorber associations with Bayesian probability Pdia.i > 0.8 the 
DLA galaxy is correctly identified ~ 60% of the time, whereas 
interlopers exceed Pdia.i > 0.8 only ~ 15% of the time. These 
rates have been computed excluding the SLLS towards quasar 
2233.9+1318, not representative of the DLA population^- This 
means that such criteria should correctly identify 60 counterparts 



Due to the limited sample available, one might be concerned that these 
values are driven by the results for DLA PKS0458-02. However, even if we 
do not include this object, we find that still 55% of the galaxies are correctly 
identified, showing that the mean is not strongly dominated by this system. 



out of 100 fields with detectable DLAs. Conversely, in 100 fields 
that do not show a DLA galaxy, these criteria would result in 15 in- 
terlopers being incorrectly classified as DLAs. If we weaken the 
probability limit down to Pdia.j > 0.6, our tests show that the 
5 DLA fields have Pdia,i > 0.6 on average 85% of the time 
whereas interlopers exceed Pdi a ,i > 0.6 typically 35% of the time. 

Turning to the discussion, we should emphasis that this statis- 
tical method is based on a set of assumptions that may not hold for 
all the sight-lines under consideration. We wish to discuss some of 
them in more details. First, the fact that we are considering a single 
galaxy-absorber association at a time can pose a limitation when a 
group of gala xies is located at the ab sorber redshift (e.g. towards 
Q2206- 1958; IWeatherlev et al.l2005l) . In fact, our analysis will fa- 
vor only one object and rejec t the other as interlopers. Con versely, 
clustering around the quasar jHennawi & Prochaska 2007) and the 
QSO host galaxy itself do not affect our analysis since UV light 
from these galaxies is absorbed by the intervening LLS, as long as 
the systems are covered in projection by this absorber. 

In addition, we cannot rule out with this statistical method that 
the detected objects are not associated with other intervening ab- 
sorbers at z < ziu. Indeed, towards QSO J21 1444-005533 we de- 
tect two Mg n systems at 2 = 2.02 and 2 = 1.84 and a C IV system 
at 2 = 3.14. Similarly, there are two Mgn systems at 2 = 1.80 and 
2 = 1.88, towards J073 149+285449. This source of confusion is 
partially alleviated by the fact that the priors on the impact param- 
eters for Mgn peak at larger values. In fact, both observations and 
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simulations (e.g. iKacprzak et al .11 20091 ; loien et alfeOldh show that 
Mgll are frequently (but not uniquely) found at b > 20 kpc. 

As for the choice of the non-evolutionary prior, simula- 
tions showjhatjr^siyehalos can host multiple gas-rich satellites 
(e.g. lCeverino et aDl2009h . In magnitude-limited surveys, only the 
brightest systems (central galaxies) will be detected, but also the 
satellites can give rise to an absorption line. Therefore, the most 
valuable quantity to set the priors may be the distance from each gas 
clump to the brightest star-forming center in the halo. In this con- 
figuration, a prior will exhibit a more extended tail towards larger 
impact parameters than the one here presented. Future works will 
address this issue. For now, we caution that we will probe only 
those DLAs that originate within the brightest central star-forming 
centers. Indeed, the inclusion of larger impact parameters in this 
statistical procedure is not a trivial task: the number of foreground 
sources is a steeply increasing function of the distance from the 
quasar and the high degree of confusion is not optimal to identify 
this particular class of DLAs via statistics. Integral-field or multi- 
object spectroscopy down to faint magnitudes becomes essential. 

Finally, we already mentioned a few times that a set of spectro- 
scopically confirmed DLAs can be used to improve this statistical 
procedure. In order to establish how large a sample should be to de- 
termine /(&), we extract randomly a subset of DLAs from the SPH 
simulation. While a large number of DLAs (~ 50-100 objects) is 
required to precisely reconstruct /(&), a smaller sample (~ 20-30 
objects) is sufficient to constrain the peak and the tail of the impact 
parameter distribution. Therefore, the present and other ongoing 
attempts to enlarge the sample of known DLAs may provide soon 
enough objects to improve this Bayesian procedure. 



5.3 Results for J211444-005533 and J073149+285449 

Before we compute Paia,i for our candidates, we remark on two 
points that have already been discussed. (1) Being a statistical anal- 
ysis, this classification is subject to individual failures and carries 
all the assumptions and uncertainty related to the choice of the pri- 
ors. (2) Due to the nature of our experiment, objects detected in the 
u band images at low impact parameters are at z < zlls- There- 
fore, the high-redshift LLS and the QSO host galaxy are not in- 
cluded in this analysis and they do not contribute to additional con- 
fusion. Additional confusion can arise from other absorbers (e.g. 
Mg n) in the line of sight. 

Bearing these caveats in mind, but encouraged by the pos- 
itive results from our tests, we apply the above statistical pro- 
cedure to the galaxies detected in the fields of J21 1444-005533 
and J073 149+285449. Reliabilities and probabilities of galaxy- 
absorber association are listed in Table |5] For the z ~ 2.919 
DLA towards J21 1444-005533, we use the templates OB0 and 
SB3, while for the z ~ 2.686 DLA towards J073 149+285449, we 
adopt OB0 and OB2.5. For the DLA towards J21 1444-005533, our 
statistics indicate that none of the detected targets has a probability 
greater than 35% of being associated with the DLA. Conversely, in 
the case of J073 149+285449, there is a probability of ~ 60% that 
object A is associated with the DLA. Adding the fact that the prob- 
ability of being an interloper is less than 10% from the frequentist 
analysis, we consider J073149+285449-A an excellent candidate 
for the DLA host galaxy. We are presently trying to confirm the as- 
sociation in J073 149+285449 through spectroscopy in the Lya line 
and UV continuum. 



6 SFR CALIBRATION 

The metal lines observed in DLAs support the idea that star for- 
mation activity has occurred at le ast previously in t hese objects, 
enriching the surrounding gas (e.g. IWolfe et al.ll2003h . A key issue 
in DLA studies is the star formation rate in these objects and its 
distribution across the ISM of the host galaxy. 

At 2 = 3, the u-band filter covers the rest frame wavelengths 
740 A < A < 1000 A, where a galaxy's emission is expected to 
be dominated by massive (M > lOM©) and short-lived (tnf e < 
2 x 10 7 yr) stars. To recover the emitted rest-frame UV flux F UlC , 
we apply a simple K-correction to the observed flux _F„ j0 under the 
assumption that the SED is not a sensitive function of wavelength 
in the FUV region: 



(18) 



where cLl is the luminosity distance to z a . We also correct for ab- 
sorption by the IGM, using an updated calculation of the effective 
opacity r e ff computed from a recent determination of f(Nm, X) 
( Prochaska et al. 2009) over a large interval of H I column densities 



(10 12 - 10 22 ' 5 cm" 2 ) at z ~ 3.7. 

We compute the transmission T igm to FUV photons consider- 
ing the first 35 lines in the Lyman series as: 

T igm (y) = exp[-T Bf f(v)] , (19) 

where the effective opacity r e // is defined by 

r e // = y) r [ /(iV H i,2)(l-e- T -)diVHidz, (20) 

with r c ,i the optical depth of an individual cloud at the frequency 
v — v R {\ + z) computed for the i-th element of the Lyman series 
with frequency Vj. To relate f(Nm,X) derived at zo = 3.7 by 
iProchaska etall d2009h to f(N H i, z) at an arbitrary redshift, we 
assume 



f(N Hl ,z) = f(N in ,X)- 



dX ( 1 + z 



dz \ 1 + zo 



(21) 



(22) 



where 

dX_ = H (l + z) 2 
dz ~ H{z) ' 

In Eq. (ED, 

we model the redshift evolution in the interval 2 < 
2 < 4 with a density dependent power law i ndex. Specifically , 
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lO'Meara et afll2007h , and 7 = 1.27 for N H i > 2 x 10 20 cm 
(Rao et al. 2006). A plot of the IGM transmission at redshifts 2, 3 
and 4 is presented in the top panel of Figure [TT] (solid lines), to- 
gether with the u (blue dashed line), V (green dashed line) and R 
(red dashed line) LRIS filter transmission curves. Comparing the 
results of our calculation with those from a similar analysis by 
Madauj d 19951) (dotted line) at z = 3, we find that the major dis- 
crepancy arises for high order lines in the Lyman series. In fact, 
the main contribution to the opacity at these wavelengths comes 
mostl y from optical ly-thick absorbers, which are more numerous 
in the lMadaul d 19951) calculation than the estimate of Prochaska et 
al. A slight offset is also visible in the Ly man-q line, w here our 
opacity is 2% higher than that computed bv lMadaul ( fl995l) . Despite 
these differences, the two calculations for the transmission through 
a broad-band filter agree to within a few percent in the interval 
2 iS z iS) 4. We derive the IGM correction Cigm to an observed 
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Table 5. Bayesian statistics for DLA candidates in our fields. For J21 1444-005533 we use the priors OB0 (column 3,4) and SB3 (column 5,6). For J21 1444- 
005533 we adopt priors OB0 (column 9,10) and OB2.5 (column 11,12). a - Rdi a expresses the probability that a candidate is the correct identification and not 
an unrelated foreground source, b - Pdi a i is the probability that the i-th galaxy is uniquely associated with the DLA. 
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Figure 11. Top panel: The IGM transmission at z = 2, 3 and 4 (solid lines), 
together with the u (blue dashed line), V (green dashed line) and R (red 
dashed line) LRIS filter transmission curves. Superi mposed is a comparison 
with the result of a similar analysis by Madau 1 1995) (dotted line) for 2 = 3. 
The major discrepancy arises for higher-order lines in the Lyman series 
due to the different numbers of optically-thick systems included in the two 
calculations. Bottom panel: The final IGM correction in magnitude as a 
function of the DLA redshift for the u- (solid line), V- (dotted line) and 
ij-band (dash-dotted line) filters. 

it-band flux by integrating the product of the effective opacity and 
the u-band transmission curve g u (X) 

Cigm = J e~ T ^f w g u {X)dX. (23) 

The final values, in magnitudes, are presented in the lower panel 
of Figure QT| as a function of redshift: the solid, dotted, and dash- 
dotted lines are for the u-, V- and i?-bands, respectively. 

Finally, we convert the UV luminosity L v (erg s _1 Hz -1 ) 



into an SFR (Mm yr x ) using the calibration at 1500A from 
Madau et al Jdl998r) . divided by 1.58 to account for a Chabrier IMF 
dSalim et al.ll2007t) : 

SFR = 7.91 x 1(T 29 L„ . (24) 

There are some caveats to this determination of the a bsolute SFR. 
First, it is not obvious whether the conversion of Mad au et al.l 
Jl998l) is applicable at wavelengths lower than 1500A. In fact, the 
presence of molecular gas in the ISM can significantly increase 
the opacity of the UV photons in the Lyman- Werner band, result- 
ing in an underestimated SFR if the star forming regions of DLAs 
are rich in molecules. Unfortunately, it is very difficult to prop- 
erly account for this effect. In addition, the SED is most likely 
frequency-dependent, especially towards harder UV frequencies. 
C omparing fluxes at 10 00 A and 1500 A with the SED templates 
of lGrazianetal] J2006|) . we find that fiooo /^i5oo ~ 1 within a 
factor of 2 in scat ter. Ad hoc calibrations can be computed, as done 
for example bv lChristensen et al. ( 2009). However, the same order 
of uncertainty is associated with different choices for the template 
age at a fixed metallicity. As a consequence, we infer that the abso- 
lute value of the SFR is uncertain at the level of a factor of 2. This 
is without considering additional complications due to dust extinc- 
tion. Contamination from Lya emission is not an issue for our two 
DLAs, as the Lya line by design does not lie within the u band. 

Applying our calibration, we derive for DLA 
J073149+285449-A an unobscured SFR of (5.4 ± 0.5 ± 2.7) hf 2 2 
Mq yr -1 , corrected by a factor of 1.9 due to IGM absorption. 
Here, the first uncertainty refers only to the error in the flux 
measurement, while the second one refers to a 50% uncertainty 
on the star formation calibration, combined with a 10% error 
from the IGM correction. To estimate the total star formation rate 
corrected for dust, one can include a factor o f ~ 2.3 , as suggested 
by X-ray measurements dReddv & Steidel |2004|) for galaxies 
with SFR < 20 hf 2 2 M yr -1 . For DLA J21 1444-005533, we 
derive an upper limit to the unobscured SFR of 1.4 hf 2 2 Mq yr -1 , 
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Figure 12. A comparison of SFRs and s izes between LBGs and DLAs. 
Crosses are for a sample of LBGs from iBouwens et alj 12004) while the 
blue triangles represent previously known DLA galaxies; the red circles 
are for DLA J073149+285449-A and the 3ct upper limit for the DLA to- 
wards J21 1444-005533. Also ov erplotted (in gTeen dash -dotted lines) are a 
typical L, and 1/10L, LBG from lReddv & Steidell <2009h a nd the expected 
SFR in DLAs as inferred from fC 11*1 bv lWolfe et al J feooj) (yellow dashed 
lines). Upper limits on the sizes are conservative estimates using the seeing 
FWHM for ground-based observations. 



computed at 3 a C.L. in a 1" aperture, and corrected by a factor 
4.4 for IGM absorption. 



7 DISCUSSION 

7.1 Star formation rate in DLAs and LBGs 

One of the outstanding questions in DLA studies is whether DLAs 
arise from the extended hy drogen reservoirs surrounding LBGs 
(e.g. lM0ller&Warr"eT]|l998l) . This idea is in agreement with the 
finding that Lya emission is more spatially extended than the UV 
emission, suggesting that photons from newly formed stars are res- 
onan tly scattered before they can escape from the wings of the Lya 
line dRauch et al]|2008l) . It is also consistent with the finding that 
DLAs have too low an in situ star formation rate to justify the chem- 
ical enrich ment and the cool ing rate inferred from [Cn*] absorp- 
tion lines0 dWolfeetal.ll2003h . The possible link between DLAs 
and LBGs has been the subject of several studies, comparing the 
sizes , morphology and luminosities of the two classes of objects 
(e.g. iFvnbo et allll999l : llVfoller et alj|2002h . It should be empha- 
sized, however, that only one DLA, the z ~ 1.92 system towards 
Q2206— 1958, has so far been directly shown to be associated with 
an LBG. 

In Figure [12] we present an updated comparison of SFRs 
and sizes between LBGs and DLAs, using the SFRs derived 
for a sample of LBGs at z ~ 3 by Bouwe ns et al. (private 
communication; see also Bouwens et al. 2004). The LBGs are 
shown with crosses, while the blue triangles represent DLAs with 



8 It should be noted that the [C II*] model of IWolfe etal] 120031) is incon- 
sistent with the H I temperature distributio n in DLA s , as derived from H I- 
21cm absorption studies. See lKanekar & C hengalur (2003); Kanek ar et alj 
l2009h . 



spectroscopically-confirmed hosts (see Appendix [B). The red cir- 
cles refer to candidate DLA 1073 149+285449- A and the 3a upper 
limit for DLA 1211444-005533, from this work. We also overplot 
with green dash-dotted lines a typical L* and 1/10L, LBG from 
iReddv & Steidell J2009I) ^ = -20.97 at z ~ 3). Finally, 
the dashed lines indicate the expec ted range of SFRs in DLAs, as 
inferred from the [Cn*] model of I Wolfe et alj (|2003). The lower 
value is for a cold neutral medium (CNM), while the higher one 
is for a warm neutral medium (WNM); in both cases we assume 
a disk size of 100 kpc 2 to convert the SFR surface density into an 
integrated value. Regarding the DLA sizes, we assume the seeing 
FWHM as an upper limit on the half-light radius for our determina- 
tions; for previou sly-known DLAs, we quote half-light radii from 
M0lle retal] d2002l) for 3 galaxies, while we assume an FWHM of 
0.8" as a conservative upper limit for the remaining cases. All of 
the quantities presented here have been rescaled to match our SF 
calibration, IGM correction and cosmology. This allows for a rela- 
tive comparison which is not affected by the systematic uncertainty 
on the absolute value for the SFR. 

Interestingly enough, J073 149+285449- A lies in the same in- 
terval of SFR observed for the earlier DLAs. Although the statistics 
are still limited, as noted in previous studies, the detected DLAs ap- 
pear to be consistent with the SFR distribution of LBGs, at least for 
the redshift interval considered. The only exception is the bright 
DLA towards Q0151+048 A, whose gas is k nown to be photo- 
ionized by a nearby QSO dFvnbo et alJll999T) . Due to the small 
sample and the many upper limits for sizes, we refrain from ad- 
ditional discussion here. We only comment on the fact that DLAs, 
being H I selecte d galaxies, are expec ted to span a wide range in UV 
luminosity (e.g. lPontzen et al.ll2008l) . However, the optical follow- 
up of these objects imposes an additional selection bias since only 
the most luminous DLAs can be observed. We remark that non- 
detection from our imaging can only be attributed to the sensitivity 
limits and hence will directly constrain the DLA luminosity func- 
tion. 

Finally, considering the SFR surface density expected from 
the [Cn*] cooling rate in DLAs, we note that a disk of 100 kpc 2 
(lower yellow dashed lines in Figure 1 1 2b sig nificantly underesti - 
mates the observed SFR, for the CNM model o fl Wolf e et all J2003h . 
For such a scenario, DLAs should typically arise in galaxies with 
star-forming regions extended over more than ~ 600 kpc 2 , simi- 
lar to present day disks. If this is the case, a significant number of 
DLAs may be at low-surface brightness. Conversely, if DLAs orig- 
inate exclusively from a WNM, model and observations agree for 
more compact disks, suggesting a typical DLA size of ~ 80 kpc 2 
(close to the higher yellow dashed line in Figure [L2t. We hope to 
improve our knowledge of the DLA size through our upcoming 
HST (PI: O'Meara, ID 11595) observations. 



7.2 The HI distribution at z > 2 

One of the most valuable results we hope to achieve is to directly 
trace how neutral gas is distributed around star-forming galaxies at 
2 > 2. This will address the fundamental question of the origin of 
DLAs and ultimately provide important constraints for models of 
galaxy formation. Whereas next generation arrays such as the SKA 
will allow direct imaging of individual (albeit large) galaxies at z ~ 
2 in the Hi 21cm line, our analysis should, given a large enough 
DLA sample, constrain the radial distribution of H I in DLAs as a 
function of the projected distance to the star-forming center. 

Evidence for an anti-correlation between the impact parame- 
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Figure 13. The H I column density as a function of the impact parameter for 
all confirmed DLA host galaxies (triangles) and for our candidate towards 
J073 149+285449 (circle). Also overplotted with blue dashed lines are (from 
left to right) the 50 , 75 , and 90 th percentiles of the impact parameter as 
a functi on of the H I column density in local disk galaxies from Zwaan et al. 

The same distribution scaled to z = 2.35 according to a simple 
model is shown with red dotted lines. Provided that previous observations 
were biased against small impact parameters, the H I in high-z DLAs seem 
more extended than what expected from single disks from simulations. 



ter and the H I column density has been obt ained by previous stud- 
ies aimed to identify D LA host galaxies er & Warrenll 19981 ; 
Christ ensen et alj|2007l) . Similar results emerge from absorption 
line studies in Q SO pairs or multiple images of lensed quasars 
dMonier et aT]|2009l) . Once again, this is consistent with a model 
in which DLAs arise in gas located around star-forming galaxies. 
In Figure Q~3] we show the 6/Nh i distribution for all DLAs whose 
impact parameters have been measured (triangles), along with our 
candidate DLA J073 149+285449- A (circle). Our system lies within 
the population of confirmed DLA hosts, supporting previous claims 
for such an anti-correlation. 

Figure[T3]also overlays the above DLA data on the conditional 
probability of the impact parameter as a f unction of the H I colu mn 
density, derived for local disk galaxies bv lZwaan et alj j2005al . see 
their figure 15). After inverting b = 6(Nhi), the 50%, 75% and 
90% percentiles of this distribution are plotted with blue dashed 
lines in the figure. The probability is found to increase with in- 
creasing impact parameters, as seen tentatively in the high-z DLA 
sample. We also show (red dotted lines) the values expected after 
scaling the impact parameter (using the toy model of section [5".2.2t 
to z = 2.35, i.e. the median redshift of the observed DLAs. Com- 
pared with the local H 1 21cm data, we find that almost all the high- 
z DLAs lie within 75% of the Hi distribution seen in present-day 
galaxies. Conversely, DLAs are seen to lie outside the 75th per- 
centile of the expected H I distribution for models in which the gas 
at high redshifts follows the dark matter halo potential. Interest- 
ingly, this result is consistent w ith the gas distribu tion of lower 
redshift (1 < z < 2) absorbers dMonier et alH 2009). We empha- 
size that some of the previous observations are likely to have been 
biased against DLAs at small impact parameters; further, our toy 
model is based on very simplistic assumptions. Given these caveats, 
we infer that gas in high-z DLA galaxies is distributed on larger 
scales than those inferred from the simulated disks. This is sug- 
gestive that the null hypothesis that gas in high-z DLA galaxies 
has not changed its distribution from z = 3 to the present epoch 



is consistent with the present data, in agreem ent with the absence 
of evo lution in /(A*hi, X), as pointed out by IProchaska & Wolfd 
(2009). While this agrees with the notion that DLAs are large disks 
at high redshifts, a model in which gas-rich satellites surround a 
central star-forming galaxy may explain the observed distribution 
equally well. In fact, even if gas is distributed in small disks inside 
individual halos, multiple satellites around a central galaxy could 
be responsible for an overall extended H I distribution. 



8 SUMMARY AND FUTURE PROSPECTS 

We have presented a new imaging programme, aimed to increase 
the sample of DLA host galaxies known between z = 2 and 
z = 3, based on an updated version of the dropout technique of 
lO'Meara et al] (2006). We imaged QSO sightlines containing two 
optically-thick absorption line systems, using the Lyman limit of 
the higher-redshift absorber to block the quasar light at wavelengths 
shortward of the limit. This allows the rest-frame UV emission 
from the lower-redshift DLA to be imaged at wavelengths between 
the Lyman limits of the two absorbers, without any contamination 
from the QSO light. Using this criterion, we have selected a sam- 
ple of 40 targets for HST and ground-based observations, which 
are currently being carried out. Once the locations of the candi- 
date host galaxies have been determined from imaging, follow-up 
long-slit spectroscopy in either the Lya or the Ha lines will allow 
a measurement of the redshift of the DLA host. IFU spectroscopy 
will also reveal properties of the ISM (e.g. the metallicity or SFR 
via emission lines) from regions that are not probed by the QSO 
absorption lines. A significant advantage of the DLAs targeted in 
our survey is that, even for spectroscopy, the QSO contamination 
disappears blueward of the Lyman limit of the higher-redshift ab- 
sorber. Any continuum detected in the spectrum comes only from 
foreground sources which are in the slit, increasing the probability 
of confirming the redshift of the galaxies associated with the low-z 
DLA. 

In this paper, we have presented preliminary results from 
the application of this technique to sightlines towards two QSOs, 
J21 1444-005533 and J073 149+285449, each hosting two high col- 
umn density absorbers. Our Keck-LRIS u-band imaging of these 
fields achieved a depth of ~ 29 mag. at la, resulting in the de- 
tection of a number of candidates for the host galaxy in each case. 
Follow-up spectroscopic studies are ongoing to confirm the galaxy- 
absorber associations for these and other candidate hosts identified 
in the programme. 

To pre-select galaxies for spectroscopy or for those cases (e.g. 
very faint galaxies) where spectroscopic confirmation will be ex- 
pensive, we have also proposed a statistical approach based on 
both number count statistics and a Bayesian treatment. This pro- 
cedure is based on several simple assumptions and it does not aim 
to correctly identify each DLA, but provides a means to build a 
statistical sample of DLA galaxies that is representative of the en- 
tire population. We provide a general Bayesian identification pro- 
cedure that can be applied to identify the galaxy counterparts of 
different types of absorption line systems, including DLAs. Due to 
the scarce present information on the impact parameter distribu- 
tion in DLAs, we derived two different priors on th is distribution, 
based on SPH simulations from lPontzen et a l. ( 2009) a nd H l-21cm 
observations of local galaxies from THINGS dWalter et alj|2008h . 
The second prior is computed under the assumption that the H I col- 
umn density distribution d oes not evolve significantly with redshift 
dProchaska & Wolfd2 009). while the first is more consistent with a 
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hierarchical picture of galaxy assembly. An observational determi- 
nation of this prior will be possible once a larger sample (> 20 ob- 
jects) of spectroscopically-confirmed DLA will be available, from 
the present and other ongoing surveys. 

We have tested the proposed statistical approach on a sample 
of five DLAs and one SLLS whose host galaxies have been spectro- 
scopically confirmed. For all DLAs, the procedure correctly identi- 
fied the galaxy giving rise to the absorber; conversely, we could not 
identify the galaxy responsible for the sub-DLA (with Nm= 10 20 
cm -2 ), perhaps because our prior has been calibrated for higher H I 
column densities. Our test suggests that with this statistical method 
we can select a sample of bona fide DLAs complete to 60%-85% 
(depending on the required confidence level) with a contamination 
from interlopers around 15%-35%. 

We then applied the proposed identification procedure to the 
candidate hosts detected in Keck-LRIS images of the fields of 
J21 1444-005533 and J073 149+285449. For J21 1444-005533, no 
galaxies is found to be associated with the DLA at high signifi- 
cance, while for J073 149+285449 we found a good candidate with 
60% probability of being the DLA and less then 10% probability of 
being an interloper. This system is at a projected distance from the 
quasar b = 1.54", corresponding to an impact parameter of 11.89 

kpc at z — 2.686. Spectroscopic confirmation of this candidate 
is now being carried out. Converting the rest-frame UV emission 
into a SFR, we measure for candidate DLA J073149+285449-A an 
unobscured SFR of 5.4 ± 0.5 ± 2.7 hf 2 2 M Q yr" 1 , where the first 
uncertainty refers only to the error in the flux measurement, while 
the second one refers to a 50% uncertainty on the star formation 
calibration, combined with a 10% error from the IGM correction. 
Conversely, we place the 3a upper limit of 1.4 hf 2 2 Mq yr" 1 on 
the SFR of the DLA towards J21 1444-005533. 

The SFR properties of our candidate and DLAs with identi- 
fied hosts from the literature appear consistent wi th those of two 
independent samples of LBG s at similar redshifts dBouwens et al.l 
120041 : iReddv &~S teidel 2009), supporting earlier suggestions that 
the brightest DLA galaxies and LBGs might be overlapping galaxy 
populations. The impact parameter b measured for the new can- 
didate DLA J073149+285449-A is also consistent with an anti- 
correlation between impact parameter and Hi column density Nhi, 
as suggested by earlier studies of DLAs and local galaxies. Com- 
paring the b-Nm distribution with the conditional probability of 
the impact parameter as a function of the Hi column density de- 
rived in local disk galaxies, we find that most DLAs lie within the 
15 th percentile. This is consistent with the an absence of redshift 
evolution i n f(Nni,X) at z > 2, c onsistent with the distribution 
at z = jProchaska & Wolfdl2009l) . but it could be equally well 
explained with systems composed by a central star-forming galaxy 
and multiple gas-rich satellites, in concordance with cold dark mat- 
ter simulations. 

With the direct images of high-z DLA host galaxies that 
will be available from our HST/Keck survey we aim to enlarge 
the current sample to answer some fundamental questions on the 
absorbers. What is the typical star formation rate in DLAs at 
z ~ 2 — 3? How are gas and stars distributed in the absorbers at 
these redshifts? Are rotationally-supported disks already in place 
at z ~ 3? Or are DLAs associated with merging gas clumps? How 
do the metallicity, the dust-to-gas ratio, and the Hi column den- 
sity depend on the impact parameter? Is there an SFR/metallicity 
or a mass/metallicity relation in DLAs? Over the next few years, it 
should hence be possible to obtain a comprehensive picture of the 
properties of high redshift DLAs, providing new insights into their 
role in the broad picture of galaxy formation and evolution. 
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APPENDIX A: PHOTOMETRIC QUANTITIES 

Quantities derived from the photometry of galaxies detected in the 
fields J21 1444-005533 and J073 149+285449 are listed in Table 43 
Relative separations from the quasar are in columns 2 and 3, in arc- 
sec. Total magnitudes, uncertainties and signal-to-noise ratios for 
the u band are in columns 4, 5, and 6, respectively. The remaining 
columns list the magnitudes computed in circular apertures in the 
R, V, and / bands. Additional details are provided in section [3^21 
All the listed magnitudes have been corrected for Galactic extinc- 
tion. 



APPENDIX B: PREVIOUS IMAGING STUDIES OF 
HIGH-Z DLAS 

In this appendix, we review all previous emission studies of z > 
2 DLAs that are known to the authors. Table E[2] summarizes the 
properties of the six such DLAs for which a galaxy counterpart 
has been spectroscopically confirmed. The z ~ 2.04 DLA towards 
PKS0458— 02 is included in this list although direct imaging of 
the as sociated galaxy is not available (See, however. IWarren et al.l 
l200lh . For this object, the impact parameter has been computed 
via long-slit spectroscopy at two orientations. Conversely, the table 
does not include DLAs whose emission lines have been detected in 
spectra, but for which no spatial information is available; these are 
discussed separately below. The columns of Table E[2]are: 

(1) The quasar name and a reference to the first identification of 
the galaxy counterpart. 

(2) The quasar redshift. 

(3) The DLA redshift. 

(4) The impact parameter, in " . 

(5) The impact parameter, in kpc. 

(6) The H 1 column density. 

(7) The published SFR. 

(8) The SFR diagnostic used for column (7). 

(9) The unobscured SFR, computed from the UV emission with 
our calibration, after applying a K-correction and an IGM correc- 
tion. 

(10) The Lya flux in 10" 17 erg cm" 2 s" 1 . 

(11) The [OIII] flux in 10~ 17 erg cm" 2 s" 1 . 
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Table 1. Table of photometric quantities. Asterisks indicate 3 < S/N < 5. Magnitudes with S/N < 3 (including non-detections in a given band) are not 
listed in the table. The listed values have been corrected for Galactic extinction. See Appendix[A]fbr additional details on each entry. 
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0.09 


14.3 


25.84 


0.08 


24.64 


0.06 


24.65 


0.05 


24.69 


0.09 


D 
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0.24 


4.6 


















E 


6.35 W 


3.92 N 
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0.06 


24.97 


0.09 


I 


3.81 E 


8.82 S 


24.44 


0.06 


41.3 


24.75 


0.06 


24.63 


0.06 


24.74 


0.05 


24.63 


0.08 


L 


7.88 E 


2.93 S 


27.93* 


0.30 


3.7 


















M 


8.35 E 


0.374 N 


25.33 


0.09 


14.9 


26.10 


0.08 


25.98 


0.13 


25.99 


0.11 


26.02 


0.19 


N 


8.87 E 


2.25 N 


25.55 


0.09 


14.6 


25.76 


0.07 


25.81 


0.13 


25.71 


0.10 


25.94 


0.20 


O 


4.06 E 


3.11 N 


28.61* 


0.34 


3.2 


















P 


6.21 E 


10.31 S 


27.28 


0.22 


5.1 


















Q 


5.81 E 


10.38 S 


27.37* 


0.25 


4.3 


















R 


3.29 W 


11.57 S 


26.99 


0.16 


7.3 


27.04 


0.13 


27.13* 


0.35 


27.23* 


0.32 


27.25* 


0.54 


S 


12.47 E 


3.46 S 


24.35 


0.06 


37.3 


24.83 


0.06 


24.21 


0.05 


24.46 


0.05 


24.23 


0.07 


T 


12.05 E 


1.57 S 


26.89 


0.21 


5.4 


26.75 


0.20 


26.05* 


0.25 


25.83 


0.18 


25.84* 


0.28 



J073 149+285449 



A 


1.28 E 


0.86 S 


25.49 


0.11 


13.3 -------- 


B 


2.44 E 


1.51 S 


26.05 


0.16 


7.4 ....... 


C 


4.50 E 


0.58 S 


27.10 


0.22 


5.2 ------- - 


D 


3.97 W 


1.73 S 


25.11 


0.10 


15.0 -------- 


E 


4.61 W 


1.10N 


26.17 


0.16 


JA ------- - 


F 


5.21 W 


3.59 N 


24.67 


0.09 


20.2 -------- 


G 


8.71 W 


5.80 N 


27.26* 


0.24 


4.7 ------- . 


H 


11.19 W 


5.98 N 


25.13 


0.10 


14.3 -------- 


I 


2.44 E 


12.14 N 


22.86 


0.07 


64.6 - 


L 


0.26 E 


8.59 S 


26.39 


0.17 


6.8 ------- - 


M 


0.58 E 


9.90 S 


27.02 


0.23 


5.0 ------- - 


N 


11.81 E 


4.63 N 


27.72* 


0.24 


4J ------- . 



Table 2. Properties of DLAs with confirmed galaxy associations. Individual references are given in the main text. References listed with the QSO name refer 
to the first galaxy-absorber association. 



Name 


Zqso 


Zdla 


l>as 


bp 


Nhi 








O 


(kpc) 


(cm -2 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


1 Q0151+048A 


1.922 


1.934 


x 0.93 


7.59 


20.36 


3 PKS0458-02 


2.286 


2.039 


3 0.30 


2.44 


21.65 


4 PKS0528-250 


2.797 


2.811 


5 1.17 


8.92 


21.35 


7 Q2206- 1958a 


2.559 


1.920 


7 0.99 


8.09 


20.65 


2 Q2206-1958b 


2.559 


1.920 


1.23 


10.05 


20.65 


8 2233.9+1318 


3.298 


3.150 


2.51 


18.52 


20.00 



SFR F(Lya) ¥{[OIII\) Magnitude r w 

(Mgjyr- 1 ) (cgs) (cgs) (") 

(7) (8) (9) (10) (11) (12) (13) (14) 

- - 18.45 x 191 - J 22.9 u 



>1.5 


Lyce 




3 5.4 








4 4.2 


Lyo 


3.17 


7 7.4 




7 25.43 


V 


5.7 


UV 


3.23 


(i 26 


7.6 


24.69 


V 


4.2 


UV 


2.41 




10.7 


25.01 


V 


5.9 


UV 


2.98 


8 6.4 


6.8 


25.75 


V 



(12) The broad-band magnitude, in AB mag. 

(13) The filter used for broad-band photometry in column (12). 

(14) The half-light radius, deconvolved for the PSF. 

Individual references are: 



Fvnboetal. J 1999 


) 


Weatheiiev et al. ( 


2005) 


M0ller et al. (2004) 



(4) lM011er&Warrer] | |l993l) 

(5) |M0ller & W arrer] jl998h 

(6) IWolfeetalj j2005h 

(7) lM0lieretal.U2OO2h 

(8) lDiorgovskietai] ( ll99fjh 

Where not explicitl y specified, data are taken from 
IWeatherlev et al.l d2005h and references therein. 
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Three other DLAs with confirmed redshifts can be found in 
the literature. A DLA at z — 3.407 has been reported to exhibit 
Lya emission in the spectrum of PC0953+4749 by Bunker (HST 
Proposal ID 10437), but no additional imaging ha s been published. 
Djorg ovski et al. report (private communication in Weatherley et al. 
1 20051) the detection of emission lines from a DLA at z = 4.1 to- 
wards the quasar DMS2247-0209. The most likely association is a 
galaxy at b = 3.3", whose inferred SFR is ~ 0.7 Mo yr . Fi- 



xy_ 

nally, iLeibundgut & Robertson! d 19991) report the detection of Lya 
emission which is spatially extended in the absorption trough of a 
DLA at z — 3.083. Due to the large velocity offset between ab- 
sorption and emission redshifts, the emission feature could result 
from an object not associated with the DLA. 

We next summarize results from other studies that have either 
identified possible galaxy counterparts o f high-2: DLAs, or placed 
upper limits on the host luminosity. lAragon-Salamanca et al.l 
dl996h reported deep near-IR images of 10 fields containing DLAs, 
finding two L\ DLA candidates at small impact parameters (~ 
1.2"). In a long-slit K-band sp ectroscopic search f or Ha emis- 
sion from eight DLAs at z > 2, Bun ker et al 1 19991) obtained 3<t 
limits in the range 5.6 - lSh~ 2 Mm vr^. lKulkarni et all teOOOt 
l200lh used HST NICMOS images of two DLAs at z = 1.892 
and z = 1.859 to place 3<r upper limits of 4.0/if 2 M@ yr -1 
and 1 .3fe7n 2 Mq y r -1 , re spectively, on the DLA star formation 



rates. lEllison et alj feOOll) found a possible association between 
the z = 3.387 DLA towards the quasar Q0201+113 and a 0.7L* 
galaxy at an impact parameter of 15/i _1 kpc. In a deep narrow- 
band imaging survey of six fi e lds w ith heavy-element quasar ab- 
sorption lines, Kulk arni et al] ( l2006h searched for Lya emission 
from absorbers at z = 2.3 — 2.5, obtaining SFR limits of ~ 
0.9 — 2.7 Mq yr -1 , assuming no dust attenuation of the Lya line. 
HST photo metry for seve ral galaxies det ected in 18 DLA fields is 
presented in lWarren et alj J2001). Finally, Christ ensen et alj J2007I) 
used IFU spectroscopy in the Lya line to identify candidate hosts 
for six DLAs at z > 2. 
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